CDS

Accession Number TCMCG033C14020
gbkey CDS
Protein Id TQE00398.1
Location complement(join(379059..379178,379900..380002,380138..380351,385542..385740,386348..387355))
Organism Malus baccata
locus_tag C1H46_013998

Protein

Length 547aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA428857, BioSample:SAMN08323692
db_source VIEB01000215.1
Definition hypothetical protein C1H46_013998 [Malus baccata]
Locus_tag C1H46_013998

EGGNOG-MAPPER Annotation

COG_category I
Description 4-coumarate-coa ligase
KEGG_TC -
KEGG_Module M00039        [VIEW IN KEGG]
M00137        [VIEW IN KEGG]
M00350        [VIEW IN KEGG]
KEGG_Reaction R01616        [VIEW IN KEGG]
R01943        [VIEW IN KEGG]
R02194        [VIEW IN KEGG]
R02221        [VIEW IN KEGG]
R02255        [VIEW IN KEGG]
R06583        [VIEW IN KEGG]
KEGG_rclass RC00004        [VIEW IN KEGG]
RC00131        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01904        [VIEW IN KEGG]
EC 6.2.1.12        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00130        [VIEW IN KEGG]
ko00360        [VIEW IN KEGG]
ko00940        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
map00130        [VIEW IN KEGG]
map00360        [VIEW IN KEGG]
map00940        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGAGCACCATCGCAAAGATGATGAGTTCATTTTCCGGTCCAAACTCCCCGATATTTACATCCCAAACCACCTCCCTCTTCACACCTACTGCTTCGAAAACATCTCCCAATTCATGGACCGCCCCTGCTTGATCAACGGCAACAACGGCGACACCTTCACCTACGCCGACGTCGAGCTCACCTCCCGCAAGGTCGCCTCGGGGCTCCACAAAATCGGCATTCACCAAACCGACGTCATCATGCTCCTCCTCCAAAACTGCCCTGAATTCGTCTTTGCATTTCTCGGCGCCTCCAATATCGGCGCAGTCGTCACCACTGCCAACCCCTTCTACACTCCGGCCGAGATGGCCAAGCAGGCAAAAGCATCCAATGCCAAACTCATCATAACGCAGTCGGCTTACGTGGACAAGGTGAAGGACTTCGCACTTAAAAACGACGTCGAGATCATGGTCGTCGACAGCGCGGAAACTGAGGAAGACGGTAATACTTATCGTCACTTCTCGGAGATGACTTCGGCGGACGAGAATGACATCCCGGCGGTGAAAATAAACCCCGAAGACGTCGTTGCGCTGCCGTATTCTTCTGGGACGACGGGGCTACCTAAAGGAGTTATGCTGACCCACAAAGGGTTGGTGACGAGCGTGGCGCAACAGGTGGACGGAGAGAATCCGAATTTGTATTTCCACAGTGAGGACGTGATCCTCTGCGTGCTGCCCTTGTTCCATATCTACTCCCTCAATTCAGTGTTTCTCTGCGGACTCAGAGTTGGGGCGGCGATACTGATCATGCAGAAGTTTGAGATCACCAAGTTGTTGGAGCTGGTGGAGAAGTACAAGGTGACGATTGCGCCTTTTGTACCTCCGATCGTTTTGAGTATTGCCAAAAGCCCCGACTTAGACCGGTACGACTTGTCATCGATAAGGATGGTGATGTCCGGTGCGGCGCCGATGGGGAAGGAGCTTGAGGATACAGTGAGGGCTAAGTTACCTAATGCCAAACTTGGACAGGGGTATGGAATGACAGAGGCTGGACCTGTGCTGTCAATGTGCTTAGCATTTGCAAAGGAACCATTTGAGATAAAATCAGGTGCGTGCGGGACTGTTGTAAGAAATGCAGAGATGAAAATTGTTGACCCTGATACGGGTGCTTCGCTTCCGCGAAATCAAGCTGGAGAGATTTGCATCAGAGGTAGCCAAATCATGAAAGGTTATCTTAATGATCCTGAAGCCACGGAGAGAACCGTAGACAAACAGGGATGGTTGCACACAGGTGATATAGGGTGCATCGACGGTGATGACGAGCTCTTCATCGTCGACCGATTAAAGGAACTCATCAAATACAAAGGGTTCCAAGTGGCTCCGGCTGAGCTTGAAGCCATGCTGATTGCCCATCCCAACATCTCCGACGCTGCTGTTGTACCTATGAAGGATGAAGCTGCAGGTGAAATTCCTGTTGCATTTGTTGTGAGATCGAACAGTTCCAAGATCTCCGAAGATGACATCAAACAATACATCTCAAAACAGGTGGTCTTTTATAAGAGAATAGGTCGGGTTTTCTTCATAGACAAAATACCCAAGGCTCCTTCTGGCAAAATCTTGAGAAAAGACTTGAGAGCAAAGCTGGCTGCAGGCCTACCCAATTAG
Protein:  
MEHHRKDDEFIFRSKLPDIYIPNHLPLHTYCFENISQFMDRPCLINGNNGDTFTYADVELTSRKVASGLHKIGIHQTDVIMLLLQNCPEFVFAFLGASNIGAVVTTANPFYTPAEMAKQAKASNAKLIITQSAYVDKVKDFALKNDVEIMVVDSAETEEDGNTYRHFSEMTSADENDIPAVKINPEDVVALPYSSGTTGLPKGVMLTHKGLVTSVAQQVDGENPNLYFHSEDVILCVLPLFHIYSLNSVFLCGLRVGAAILIMQKFEITKLLELVEKYKVTIAPFVPPIVLSIAKSPDLDRYDLSSIRMVMSGAAPMGKELEDTVRAKLPNAKLGQGYGMTEAGPVLSMCLAFAKEPFEIKSGACGTVVRNAEMKIVDPDTGASLPRNQAGEICIRGSQIMKGYLNDPEATERTVDKQGWLHTGDIGCIDGDDELFIVDRLKELIKYKGFQVAPAELEAMLIAHPNISDAAVVPMKDEAAGEIPVAFVVRSNSSKISEDDIKQYISKQVVFYKRIGRVFFIDKIPKAPSGKILRKDLRAKLAAGLPN